A Supervised Statistical Learning Approach for Accurate Legionella pneumophila Source Attribution during Outbreaks.
نویسندگان
چکیده
Public health agencies are increasingly relying on genomics during Legionnaires' disease investigations. However, the causative bacterium (Legionella pneumophila) has an unusual population structure, with extreme temporal and spatial genome sequence conservation. Furthermore, Legionnaires' disease outbreaks can be caused by multiple L. pneumophila genotypes in a single source. These factors can confound cluster identification using standard phylogenomic methods. Here, we show that a statistical learning approach based on L. pneumophila core genome single nucleotide polymorphism (SNP) comparisons eliminates ambiguity for defining outbreak clusters and accurately predicts exposure sources for clinical cases. We illustrate the performance of our method by genome comparisons of 234 L. pneumophila isolates obtained from patients and cooling towers in Melbourne, Australia, between 1994 and 2014. This collection included one of the largest reported Legionnaires' disease outbreaks, which involved 125 cases at an aquarium. Using only sequence data from L. pneumophila cooling tower isolates and including all core genome variation, we built a multivariate model using discriminant analysis of principal components (DAPC) to find cooling tower-specific genomic signatures and then used it to predict the origin of clinical isolates. Model assignments were 93% congruent with epidemiological data, including the aquarium Legionnaires' disease outbreak and three other unrelated outbreak investigations. We applied the same approach to a recently described investigation of Legionnaires' disease within a UK hospital and observed a model predictive ability of 86%. We have developed a promising means to breach L. pneumophila genetic diversity extremes and provide objective source attribution data for outbreak investigations.IMPORTANCE Microbial outbreak investigations are moving to a paradigm where whole-genome sequencing and phylogenetic trees are used to support epidemiological investigations. It is critical that outbreak source predictions are accurate, particularly for pathogens, like Legionella pneumophila, which can spread widely and rapidly via cooling system aerosols, causing Legionnaires' disease. Here, by studying hundreds of Legionella pneumophila genomes collected over 21 years around a major Australian city, we uncovered limitations with the phylogenetic approach that could lead to a misidentification of outbreak sources. We implement instead a statistical learning technique that eliminates the ambiguity of inferring disease transmission from phylogenies. Our approach takes geolocation information and core genome variation from environmental L. pneumophila isolates to build statistical models that predict with high confidence the environmental source of clinical L. pneumophila during disease outbreaks. We show the versatility of the technique by applying it to unrelated Legionnaires' disease outbreaks in Australia and the UK.
منابع مشابه
A single Legionella pneumophila genotype in the freshwater system in a ship experiencing three separate outbreaks of legionellosis in 6 years
BACKGROUND Recurrent legionella outbreaks at one and the same location are common. We have identified a single Legionella pneumophila genotype associated with recurrent Legionella outbreaks over 6 years. METHODS Field emergency surveys following Legionella outbreaks were performed on a vessel in 2008, 2009 and 2013. Water samples from both the distribution and technical parts of the potable w...
متن کاملClinical application of a multiplex real-time PCR assay for simultaneous detection of Legionella species, Legionella pneumophila, and Legionella pneumophila serogroup 1.
We developed a single-tube multiplex real-time PCR assay capable of simultaneously detecting and discriminating Legionella spp., Legionella pneumophila, and Legionella pneumophila serogroup 1 in primary specimens. Evaluation of 21 clinical specimens and 115 clinical isolates demonstrated this assay to be a rapid, high-throughput diagnostic test with 100% specificity that may aid during legionel...
متن کاملفراوانی لژیونلا پنوموفیلا در شیر آب سرد و گرم و مخزن آب انکوباتورهای بخش نوزادان بیمارستانهای گیلان
Background and purpose: Nosocomial outbreaks of legionnaires’ diseases are usually related to contamination of water sources. This survey investigated the frequency of mip gene in cold and warm water taps and water containers of infant incubators containing legionella pneumophila in hospitals of Guilan province, Iran. Materials and methods: This cross-sectional study used 140 samples. They wer...
متن کاملEnvironmental survey of Legionella pneumophila in hot springs in Taiwan.
Acquisition of sporadic community-acquired legionnaires' disease has been linked to hot springs and whirlpool baths. Outbreaks of hot spring-associated legionnaires' disease were reported in Japan in the last few years. Although the mode of transmission is unclear, the presence of Legionella in hot springs may discourage hot springs resort visits by the general public. An environmental survey w...
متن کاملThe Different Antibacterial Impact of Silver Nanoparticles Against Legionella pneumophila Compared to Other Microorganisms
Legionella pneumophila is the pathogen responsible for severe pneumonia known as Legionnaires’ disease. Legionella can live under varied stress conditions, especially in cold environments, and is common in many artificial environments. In this study, the antimicrobial activity of biogenic silver nanoparticles, prepared using the culture supernatant of Klebsiella pneumoniae, was evaluated agains...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Applied and environmental microbiology
دوره 83 21 شماره
صفحات -
تاریخ انتشار 2017